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CLAIMS 

1 1. An\pparatus comprising: 

2 at leas^one processor; 

3 a memory coupled to the at least one processor; 

4 a cluster engine residing in the memory and executed by the at least one 

5 processor; \ 

6 a job residing in the rnemory and executed by the at least one processor, the job 

7 including: 

8 at least one work tftread that performs at least one predefined task; and 

9 a main thread that receives messages from at least one computer system 

10 coupled to the apparatus, that routes appropriate messages from the at least one 

1 1 computer system to the at least onework thread, and that signals to the cluster 

12 engine when at least one fault occurs Nyhen the at least one work thread performs 

1 3 the at least one predefined task. 

1 2. The apparatus of claim 1 wherein the at leastane predefined task comprises a 

2 protocol that includes at least one acknowledge (ACKjNpund, and that performs only 

3 local processing between ACK rounds. 

1 3. The apparatus of claim 1 wherein the main thread perf^nns only local processing. 

1 4. The apparatus of claim 1 wherein the main thread does not w^ait for any local 

2 resource, and thus is guaranteed to receive a message sent by the cluster engine. 

1 5. The apparatus of claim 1 wherein the signal to the cluster engine comprises an 

2 unregistration with the cluster engine 
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1 6. The apparatus of claim 5 wherein the unregistration with the cluster engine causes 

2 the cluster engine toVenerate a membership change message. 
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1 yf^ A networked computer system comprising: 

2 a cluster of computer systems that each includes: 

3 a network interface that couples each computer system via a network to 

4 other computer^ystems in the cluster; 

5 a memory\ 

6 a cluster engine residing in the memory and executed by the at least one 

7 processor; \ 

8 a job residing in thesmemory and executed by the at least one processor, 

9 the job including: \ 

10 at least one work thread that executes a predefined protocol that 

1 1 includes at least one acknowledge (ACK) round, wherein the protocol only 

1 2 performs local tasks between A€K rounds; and 

13 a main thread that registers with the cluster engine to become a 

14 member of a group, that receives messages from at least one computer 

1 5 system coupled to the apparatus, that routes appropriate messages from the 

16 at least one computer system to the at least one work thread, and that 

1 7 signals to the cluster engine when at least onfe fault occurs when the at 

1 8 least one work thread performs the at least one predefined task by 

1 9 unregistering with the cluster engine, wherein unrbgistering with the 

20 cluster engine causes the cluster engine to generate ^membership change 

2 1 to remaining members of the group. \ 
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1 A computer-implemented method for notifying jobs that form a group in a 

2 clustered computing environment when a member of the group is no longer alive, the 

3 method comprising the steps of: 

4 defining a task; 

5 providing a cluster engine for each member of the group that communicates with 

6 the other cluster engines in the group; 

7 providing at leasryone work thread for each job that executes the task; 

8 providing a main tnread for each job, the main thread performing the steps of: 

9 receiving messages from other members of the group via the cluster 

1 0 engine corresponding to the main thread; 

1 1 routing appropriate messages from the other members of the group to the 

1 2 at least one work thread; and 

13 signaling to the cluster engine when at least one fault occurs during the 

1 4 execution of the task by the work thread. 

1 9. The method of claim 8 wherein the task comprises a protocol that includes at least 

2 one acknowledge (ACK) round, and that performs only local processing between ACK 

3 rounds. \ 

1 10. The method of claim 8 wherein the main thread performs only local processing. 

1 11. The method of claim 8 wherein the mainvthread does not wait for any local 

2 resource, and thus is guaranteed to receive a message sent by the cluster engine. 

1 12. The method of claim 8 wherein the step of signaling to the cluster engine 

2 comprises the step of unregistering with the cluster engine. 
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1 13. The method of claim 12 wherein the step of unregistering with the cluster engine 

2 causes the cluster engine to generate a membership change message to remaining 

3 members of the group. 
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1 ^t#T A computer-implemented method for notifying jobs that form a group in a 

2 clustered computing environment when a member of the group is no longer alive, the 

3 method comprising the steps of: 

4 defining a protocol that includes at least one acknowledge (ACK) round, and that 

5 performs only local processing between ACK rounds; 

6 providing a cluster engine for each member of the group that communicates with 

7 the other cluster engines\ji the group; 

8 providing at least one work thread for each job that executes at least a portion of 

9 the protocol; \ 

10 providing a main thread for each job, the main thread performing the steps of: 

1 1 registering with the cluster engine to become a member of the group; 

12 receiving messaged from other members of the group via the cluster 

13 engine corresponding to theimain thread; 

14 routing appropriate messages from the other members of the group to the 

15 at least one work thread; \^ 

1 6 wherein the main thread performs only local processing and does not wait 

1 7 for any local resource, and thus ^guaranteed to receive a message sent by the 

1 8 cluster engine; \ 

19 unregistering with the clustenengine when at least one fault occurs during 

20 execution of the protocol. \ 

1 15. The method of claim 14 wherein the stepW unregistering with the cluster engine 

2 causes the cluster engine to generate a membership change message to remaining 

3 members of the group. \ 
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1 ^ progranWoduct comprising: 

2 (A) a computer program comprising: 

3 at least one work thread that performs at least one predefined task; and 

4 a mam th^e ad that receives messages from a corresponding cluster engine, 

5 that routes appropriate messages from the cluster engine to the at least one work 

6 thread, and that signal's to the cluster engine when at least one fault occurs when 

7 the at least one work thread performs the at least one predefined task; and 

8 (B) signal bearing media bearing the computer program. 

1 17. The program product of claim 1\6 wherein the signal bearing media comprises 

2 recordable media. \ 

1 18. The program product of claim 16 wherein the signal bearing media comprises 

2 transmission media. \ 

1 19. The program product of claim 1 6 wherein the at least one predefined task 

2 comprises a protocol that includes at least one acknowledge (ACK) round, and that 

3 performs only local processing between ACK roundsA 

1 20. The program product of claim 1 6 wherein the mam thread performs only local 

2 processing. \ 

1 21 . The program product of claim 16 wherein the main thread does not wait for any 

2 local resource, and thus is guaranteed to receive a message sent by the cluster engine. 

1 22. The program product of claim 16 wherein the signal to the\^ster engine 

2 comprises an unregistration with the cluster engine v 

\ 

\ 
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1 23. The program product of claim 22 wherein the unregistration with the cluster 

2 engine causes the cluster engine to generate a membership change message. 
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1 >24^ A program product comprising: 

\ 

2 (A) a computer program comprising: 

3 \ at least one work thread that performs a predefined protocol that 

4 includes at least one acknowledge (ACK) round, wherein the protocol only 

5 performs local tasks between ACK rounds; and 

6 \ a t ^ irea ^ re §* sters w ^ ^ e °l uster en §i ne t0 become a 

7 member of a group, that receives messages from at least one computer 

8 systemVoupled to the apparatus, that routes appropriate messages from the 

9 at least one computer system to the at least one work thread, and that 

10 signals to the cluster engine when at least one fault occurs when the at 

1 1 least one work thread performs the at least one predefined task by 

12 unregistering with the cluster engine, wherein unregistering with the 

13 cluster engine causes the cluster engine to generate a membership change 

14 to remaining members of the group; and 

1 5 (B) signal bearing media bearing the computer program. 

1 25. The program product of claim\24 wherein the signal bearing media comprises 

2 recordable media. \ 

1 26. The program product of claim 24 wherein the signal bearing media comprises 

2 transmission media. \ 
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